Industrial Scene Text Detection With Refined Feature-Attentive Network

نویسندگان

چکیده

Detecting the marking characters of industrial metal parts remains challenging due to low visual contrast, uneven illumination, corroded character structures, and cluttered background part images. Affected by these factors, bounding boxes generated most existing methods locate low-contrast text areas inaccurately. In this paper, we propose a refined feature-attentive network (RFN) solve inaccurate localization problem. Specifically, design parallel feature integration mechanism construct an adaptive representation from multi-resolution features, which enhances perception multi-scale texts at each scale-specific level generate high-quality attention map. Then, attentive refinement is developed map rectify location deviation candidate boxes. addition, re-scoring designed select with best rectified location. Moreover, two scene datasets, including total 102156 images 1948809 instances various structures parts. Extensive experiments on our dataset four public datasets demonstrate that proposed method achieves state-of-the-art performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Enhancement Network: A Refined Scene Text Detector

In this paper, we propose a refined scene text detector with a novel Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement. Retrospectively, both region proposal with only 3 × 3 sliding-window feature and text detection refinement with single scale high level feature are insufficient, especially for smaller scene text. Therefore, we design a new FEN network with ta...

متن کامل

Perspective Scene Text Recognition with Feature Compression and Ranking

In this paper we propose a novel character representation for scene text recognition. In order to recognize each individual character, we first adopt a bag-of-words approach, in which the rotation-invariant circular Fourier-HOG features are densely extracted from an individual character and compressed into middle level features. Then we train a set of two-class linear Support Vector Machines in...

متن کامل

Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees

Maximally Stable Extremal Regions (MSERs) have achieved great success in scene text detection. However, this low-level pixel operation inherently limits its capability for handling complex text information efficiently (e. g. connections between text or background components), leading to the difficulty in distinguishing texts from background components. In this paper, we propose a novel framewor...

متن کامل

Scene Text Area Detection from Video

Text detection from videos is a well known research area. Especially the detection of static superimposed text such as captions has been researched successfully, but makes many assumptions that question the applicability of those algorithms for moving scene text. In this dissertation, I propose a scene text area detection approach that includes a simple key frame extraction, feature extraction,...

متن کامل

Text Detection in Indoor/Outdoor Scene Images

In this paper, we propose a novel methodology for text detection in indoor/outdoor scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully process indoor/ outdoor scene images having shadows, non-uniform illumination, low contrast and large signal-depende...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2022

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2022.3156390